Kernel methods for Multi-labelled classification and Categorical regression problems
نویسندگان
چکیده
This report presents a SVM like learning system to handle multi-label problems. Such problems arise naturally in bio-informatics. Consider for instance the MIPS Yeast genome database in [12], it is formed by around 3,300 genes associated to their functional classes. One gene can have many classes, and different genes do not belong to the same number of functional categories. Such a problem can not be solved directly with classical approaches and it is generally decomposed into many two-class problems. The binary decomposition has been done partially by different researchers [12] on the Yeast dataset but it does not provide a satisfactory answer. We explore in this report a new direct approach. It is based on a large margin ranking system that shares a lot of common properties with Support Vector Machines. We tested it on a toy problem and on real datasets with positive results. We also present a new method to do feature selection with multi-labelled datasets. Our method is based on the multiplicative update rule defined in [17].
منابع مشابه
A kernel method for multi-labelled classification
This article presents a Support Vector Machine (SVM) like learning system to handle multi-label problems. Such problems are usually decomposed into many two-class problems but the expressive power of such a system can be weak [5, 7]. We explore a new direct approach. It is based on a large margin ranking system that shares a lot of common properties with SVMs. We tested it on a Yeast gene funct...
متن کاملMULTI CLASS BRAIN TUMOR CLASSIFICATION OF MRI IMAGES USING HYBRID STRUCTURE DESCRIPTOR AND FUZZY LOGIC BASED RBF KERNEL SVM
Medical Image segmentation is to partition the image into a set of regions that are visually obvious and consistent with respect to some properties such as gray level, texture or color. Brain tumor classification is an imperative and difficult task in cancer radiotherapy. The objective of this research is to examine the use of pattern classification methods for distinguishing different types of...
متن کاملSparse Kernel Orthonormalized PLS for feature extraction in large data sets
We propose a kernel extension of Orthonormalized PLS for feature extraction, within the framework of Kernel Multivariate Analysis (KMVA) KMVA methods have dense solutions and, therefore, scale badly for large datasets By imposing sparsity, we propose a modified KOPLS algorithm with reduced complexity (rKOPLS) The resulting scheme is a powerful feature extractor for regression and classification...
متن کاملComposite Kernel Optimization in Semi-Supervised Metric
Machine-learning solutions to classification, clustering and matching problems critically depend on the adopted metric, which in the past was selected heuristically. In the last decade, it has been demonstrated that an appropriate metric can be learnt from data, resulting in superior performance as compared with traditional metrics. This has recently stimulated a considerable interest in the to...
متن کاملOptimal Categorical Attribute Transformation for Granularity Change in Relational Databases for Binary Decision Problems in Educational Data Mining
This paper presents an approach for transforming data granularity in hierarchical databases for binary decision problems by applying regression to categorical attributes at the lower grain levels. Attributes from a lower hierarchy entity in the relational database have their information content optimized through regression on the categories ́ histogram trained on a small exclusive labelled sampl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001